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Bacterial receptor structures 

The present invention relates to new bacterial re- 
r.ptor structures originating from natural bacterial re- 
5 ceptor structures which have been modified in regard to 
amino acid residues involved in the original interaction 
function, whereby said original interaction function has 
been substantially inhibited and replaced by a modified 
interaction function directed to a desired interaction 
10 partner. 

Several bacteria known to invade mammals have evolved 
surface proteins capable of binding to a variety of sub- 
stances including host specific carbohydrates and prote- 
ins. Several such receptors from Gram-positive bacterial 

15 pathogens have been isolated and characterized in detail 
as will be shown below. Most well-characterized are the Fc 
receptors, named after the capability of binding to the 
constant Fc part of IgG. Based on binding experiments to 
IgG from various mammalian sources, and subclasses there- 

20 of, Fc receptors have been divided into six types I -VI. 
The receptor- from S. aureus, protein A [SPA], defining- the 
type I receptor, has been the subject of immense studies. 

SPA binds IgG from most mammalian species, including 
man. Of the four subclasses of human IgG, SPA binds to 

25 IgGl, and IgG4 but shows very weak or no interaction with 
IgG3 [Eliasson, M. et al, 1989 J.Biol.Chem. 9:4323-4327]. 
This pseudoimmune reaction has been used for more than 20 
years for the purification and detection of antibodies in 
diagnostic, research and therapeutic applications- Clo- 

30 ning, sequencing and Escherichia coli expression of defi- 
ned fragments of the SPA gene revealed a highly repetitive 
organization, with five IgG binding domains [E-D-A-B-C], a 
cell wall spanning region and membrane anchoring sequence 
[XM] [Uhlen, M. et al, 1984 J.Biol.Chem. 259:1695-1702; 

35 Moks, T. et al, 1986 Eur . J.Biochem. 156:637-643]. A vast 
number of plasmid vectors have been constructed, allowing 
gene fusions to different fragments of the gene for the 
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purpose of fusion protein production in different hosts 
[Nilsson B. and Abrahmsen, L. 1990 Meth.Enz. 185:144-161] 
(Fig. 2a). 

The structure for a complex between human Fc [IgGl] 
5 and a single domain [B] of SPA has been determined by X- 
ray crystallography at a resolution of 2.8 A [Deisenhof er, 
J. et al 1981 Biochemistry 20:2361-2370]. Based on this 
structure and additional information from NMR experiments , 
the B domain can be viewed as a compact structure consis- 

10 ting of three anti-parallel cc-helices connected with 

loops. In the Fc binding, which is of both electrostatic 
and hydrophobic nature, only side chains of residues from 
helices 1 and 2 are involved, whilst the third helix is 
not participating in the binding. Based on this domain B, 

15 a synthetic IgG-binding domain [Z] [Nilsson, B. et al 1987 
Prot.Eng. 1:107-113] has been constructed, suitable as fu- 
sion partner for the production of recombinant proteins 
which allows purification by IgG affinity chromatography. 
The high solubility and the stable structure of the Z 

20 domain has been utilized for production, purification and 
renaturation of a large number of recombinant proteins. 
[Josephsson, S. and Bishop, R. Trends Biotechnol. 6:218- 
224; Samuelsson, E. et al 1991 Bio.Technol. 9:363-366] 

Streptococcal strains of serological groups C and G 

25 display a binding repertoire for mammalian IgGs, including 
human IgG3, which is even broader than for the type I re- 
ceptor. The name protein G was suggested for this type III 
receptor from group G streptococci. In 1986 Olsson and co- 
workers reported on the cloning and sequencing of the gene 

30 from the serological group G streptococci [G148] [Guss, B. 
et al, 1987 EMBO J. 5:1567-1575; Olsson, A. et al, 1987 
Eur. J.Biochem. 168:319-324]. In analogy with SPA is SPG a ' 
repetitively arranged molecule, comprising an IgG-binding 
region of three homologous domains [C1,C2,C3], spaced by 

35 smaller D-regions (Fig. 2A). Compared to SPA, SPG displays 
a different binding spectra for immunoglobulins from dif- 
ferent species and subclasses thereof. The IgG binding 
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domains of protein G are now widely used as an immuno- 
logical tool, i.e. in the affinity purification of mono- 
clonal antibodies. Production of subfragments constructed 
by DN A -technology, have shown that an individual C-region 
5 is sufficient for full IgG-binding. Recently, the struc- 
ture for a complex between the CI -domain from SPG and hu- 
man Fc was determined with X-ray crystallography (Fig. 
2B). This shows that SPG binds to the CH2-CH3 interface 
but at a different site compared to SPA. The binding is 
10 mainly of electrostatic nature which is in contrast to the 
large contribution of hydrophobic forces seen for the SPA- 
Fc interaction. Moreover, the 3-D structure of CI differs 
from the X structure in that it is built up by two P- 
sheets connected by an a-helix [PP-a-PP]. The residues of 
15 CI which according to the structure are involved in the 
binding, corresponds to the a-helix, the loop and the 
following 0-sheet. 

An additional activity of SPG is the capability to 
bind serum albumin. The binding strength is species depen- 
20 dent, and among the tested samples, SPG binds strongest to 
serum albumin from rat, man and mouse. Production and bin- 
ding studies of subfragments of SPG shows that the two 
binding activities are structurally separated and that the 
serum albumin binding function is located at the repeti- 
25 tive A-B region [Nygren et al 1990 Eur . J.Biochem. 193:143- 
148] . This region has been used for several biotechnolo- 
gical purposes. Recombinant proteins have been produced as 
fusions to the region which enables the purification by 
affinity chromatography, where human serum albumin most 
30 frequently has been used as immobilized ligand. Proteins 
found to be proteolytically sensitive have been produced 
as "dual affinity fusions" in which they are flanked by 
two different affinity tails derived from SPA and SPG, 
respectively. Purification schemes employing both the N- 
35 and C-terminal are thus possible which ensures the recove- 
ry of an intact target protein [Hammarberg et al 1989 
Proc. Natl. Acad. Sciences USA 86:4367-4371]. The strong and 
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specific binding to serum albumin has also been used for 
the in vivo stabilization of therapeutic proteins. 

Through complex formation with the very long-lived 
serum albumin, the receptor is carried in the circulation 
5 (macaque apes) with a half -life which is close to the 
half -life for serum albumin itself- Studies in mice with 
the for HIV/ AIDS therapy interesting, but rapidly cleared 
T-cell receptor CD4, showed that it was substantially sta- 
bilized when fused to the serum albumin binding region, 

10 when compared with an unfused control protein [Nygren et 
al 1991 Vaccines 91 Cold Spring Harbor Press 363-368]. The 
slow clearance can probably be explained by the complex 
formation with serum albumin which circumvents elimination 
by the liver and excretion in the kidney. 

15 In order to determine the minimal extension required 

for maintained binding to serum albumin, several smaller 
fragments of the A-B region have been produced and ana- 
lyzed. The smallest fragment so far with serum albumin 
binding activity is a 46 residue fragment [ M B2A3"] compri- 

20 sing region A3 flanked by 13 and 9 residues, respectively , 
from regions B2 and S. 

Based on homology and binding studies of other par- 
tial fragments SPG is regarded to be trivalent with regard 
to binding to serum albumin. Similar to the monovalent 

25 IgG-binding domains Z and CI B2A3 is relatively small and 
shows high solubility and stability and is therefore a 
suitable candidate for modification. 

Summary of the invention 

30 The present invention has for its main purpose to 

provide new bacterial receptor structures by modifying 
natural bacterial receptors in regard to their original 
interaction functions to result in new structures having 
modified interaction functions. 

35 Another object of the invention is to provide arti- 

ficial bacterial receptor structures which are stable and 
more resistant to various conditions, such as increased 
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temperatures . 

Yet another object of the invention is to provide 
artificial bacterial receptor structures, the interaction 
functions of which have been modified to direct same to 
5 other desired interaction partners. 

With these and other objects that will be clear from 
the following disclosure in mind the invention provides 
for novel proteins obtainable by mutagenesis of surface- 
exposed amino acids of domains of natural bacterial re- 

10 ceptors said proteins being obtained without substantial 
loss of basic structure and stability of said natural 
bacterial receptors. Said proteins have preferably been 
selected from a protein library embodying a repertoire of 
said novel proteins. In such novel bacterial receptor 

15 structures, at least one amino acid residue involved in 

the interaction fuction of the original bacterial receptor 
has been made subject to substitution by another amino 
acid residue so as to result in substantial loss of the 
original interaction capacity with a modified interaction 

20 capacity being created, said substitution being made with- 
out substantial loss of basic structure and stability of 
the original bacterial receptor. 

It is preferred that said bacterial structures origi- 
nate from Gram-positive bacteria. Among such bacteria 

25 there may be mentioned Staphylococcus aureus, 

Streptococcus pyogenes [group A], Streptococcus group 
C,G,L, bovine group G streptococci, Streptococcus 
zooepldemicus [group C] , Streptococcus zooepldemicus S212, 
Streptococcus pyogenes [group A] , streptococci groups 

30 A,C,G, Peptostreptococcus magnus, Streptococcus agalactlae 
[group B] . 

Of special interest are thermophilic bacteria evolved 
to persist in environments of elevated temperatures. Re- 
ceptors from species like e.g. Bacillus stearothermo- 
35 philus, Thermus aquaticus, Thermococcus litoralis and 
Pyrococcus have the potential of being naturally excep- 
tionally stable, thus suitable for providing structural 



WO 95/19374 



PCT/SE95/00034 



6 

frameworks for protein engineering according to the in- 
vention. 

It is particularly preferred to use as a starting ma- 
terial for the modification of the interaction function 
5 bacterial receptor structures originating from staphylo- 
coccal protein A or streptococcal protein G. 

Among preferred receptors there may be mentioned bac- 
terial receptors originating from Fc[IgG] receptor type I, 
type II, type III, type IV, type V and type VI, fibronec- 
10 tin receptor, M protein, plasmin receptor, collagen re- 
ceptor, fibrinogen receptor or protein L [K light chains], 
protein H [human IgG] , protein B [human IgA,Al], protein 
Arp [human IgA] • 

Particularly preferred bacterial receptors originate 
15 from the Fc[ IgG] receptor type I of staphylococcal protein 
A or the serum albumin receptor of streptococcal protein 
G. 

In order to maintain stability and the properties of 
the original bacterial receptor structure it is preferred 

20 in accordance with the present invention that the substi- 
tution involving amino acid residues taking part in the 
interaction function of the original bacterial receptor 
does not involve more than about 50% of the amino acid 
residues of the original bacterial receptor- It is parti- 

25 cularly preferred that not more than about 25% of the 

amino acid residues of the original bacterial receptor are 
made subject to substitution. 

In regard to the original bacterial receptor struc- 
tures selected for modification of their interaction func- 

30 tions it is particularly preferred to use receptors origi- 
nating from the IgG-binding domains Z, CI, and the serum 
albumin binding domains B2A3. 

In order to maintain as far as possible the stability 
and properties of the original receptor structure subject 

35 to modification in accordance with the present invention 
it is preferred that substitution thereof involves not 
more than substantially all of the amino acid residues 
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taking part in the interaction function of the original 
bacterial receptor. 

In order to obtain favourable properties concerning 
stability and resistance to various conditions it is pre- 
5 ferred that the bacterial receptor according to the pre- 
sent invention is comprised of not more than about 100 
amino acid residues. It is known from scientific reports 
that proteins of a relatively small size are fairly resis- 
tant to increased temperatures and also to low pH and cer- 

10 tain chemicals. For details concerning temperature resis- 
tance c.f. the article by Alexander et al. in Biochemistry 
1992, 31, pp 3597-3603. 

With regard to the modification of the natural bacte- 
rial receptor structure it is preferred to perform the 

15 substitution thereof by resorting to a genetic enginee- 
ring, such as site-directed mutagenesis. 

With regard to the interaction partner of the modi- 
fied natural bacterial receptor a multitude of substances 
are conceivable, such as proteins, lipids, carbohydrates 

20 and inorganic substances. Among carbohydrates examples are 
blood group determinants and pathogen specific oligo- 
saccharides. 

In regard to proteins conceivable interaction part- 
ners are IGF-I, IGF-II, hGH, Factor VIII, insulin and 

25 apolipoprotein and their respective receptors as interac- 
tion partners. Furthermore, by selecting new receptor 
variants with specificity for different folding forms of 
proteins, affinity resins or analytical tools to facili- 
tate the isolation of correctly folded molecules can be 

30 produced. Further examples are viral coat proteins, bac- 
terial antigens, biotin and cell markers, such as CD 34 
and CD 4. 

Although the present invention is applicable to a 
variety of natural bacterial receptors the following 
35 illustration of the invention more in detail will be 

directed to the use of the IgG-binding domains Z, CI and 
B2A3. The concept of the present invention residing in the 
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use of artificial bacterial receptors based on the natural 
structures of naturally occurring bacterial receptors is 
associated with several advantages. Thus, the invention 
makes it possible to use robust and stable, highly soluble 
5 and secretion competent receptors. This is in contrast to 
previous techniques based on the use of polyclonals and 
monoclonals, such as for diagnostic purposes/ which are 
not very stable in connection with storage, varying condi- 
tions, such as varying temperatures etc* Furthermore, the 

10 invention makes it possible to modify natural bacterial 
receptors to obtain desired interaction capacities for 
specific purposes. 

For the selection of such functional variants in a 
large repertoire, a powerful selection system must be 

15 employed. Recent developments in this field offer alter- 
native methods. One of the most important tools for pro- 
tein engineering that has emerged during the last years is 
the phage display of proteins. By recombinant DNA tech- 
niques, single phage particles can be prepared which on 

20 their surface carries a protein fused to a phage-coat 
protein. By panning from a large pool of phages bearing 
different proteins, or variants of a specific protein, 
specific phage clones can be sorted out, which displays a 
certain binding characteristic [WO92/20791 to Winter et 

25 al]. Since the phage particle contains packed DNA encoding 
the phage protein components, a coupling between the spe- 
cific variant of the displayed protein and the correspon- 
ding genetic information is obtained. Using this techni- 

g 

que, typically 10 phage clones can simultaneously be 
30 generated and subjected to panning for screening of de- 
sired characteristics. The phage display technique can be 
used for selection of both small peptides as well as more 
complicated proteins such as antibodies, receptors and 
hormones. For selection of proteins which cannot be se- 
35 creted, which is a prerequisite for phage display, intra- 
cellular systems have been developed in which the library 
of proteins are fused to a repressor protein with affinity 
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for a specific plasmid-borne operator region resulting in 
a coupling between the specific protein variant and the 
plasmid that encoded it. An alternative to the phages as 
bearer of protein libraries would be to use bacterial 
5 cells- Recently, display of recombinant proteins on the 
surface of Staphylococcus xylosus based on fusions to the 
cell-wall anchoring domain was demonstrated, which opens 
the possibility of display also of repertoires of proteins 
for affinity selection of specific variants [Hansson, M. 

10 et al 1992 J. Bacteriology 174:4239-4245]. Furthermore, by 
structure modelling using computer graphic simulations, 
predictions of the binding and function of altered vari- 
ants of a protein can theoretically be done before the 
construction of the gene encoding the protein. 

15 As indicated above the present invention describes 

the construction of novel proteins based on the mutagene- 
sis of surface exposed amino acids of domains derived from 
bacterial receptors. These artificial bacterial receptors 
can be selected for different applications using a phage 

20 display system. The benefits from using bacterial recep- 
tors as structural frameworks are several. They have evol- 
ved to express a binding function without disturbing the 
overall structure. They are naturally highly soluble, 
robust to unphysiological conditions, such as pH and heat, 

25 folding efficient and are in addition secretion competent. 

The invention finds use in several different areas. 
The introductory part of the above-identified patent 
specification WO92/20791 gives an excellent survey on an- 
tibodies and their structure. Reference is particularly 

30 made to page 1 thereof. 

The bacterial receptors SPA and SPG have been widely 
used in antibody technology for detection and purification 
of antibodies from e.g. hybridom supernatants and ascites 
fluids. However, not all antibodies are recognized by 

35 these receptors, depending on species and subclass. For 
the smaller subfragments of antibodies (Fig. 4), SPA and 
SPG show a limited binding, and efficient tools for gene- 
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ral purification schemes are lacking. However, from a 
repertoire of mutant receptors including SPA and SPG, 
forms displaying a broader affinity for antibodies and 
subfragments thereof can potentially be selected. 
5 The complex structural organization of antibodies has 

a number of consequences for their use in different appli- 
cations as well for the production of recombinant deriva- 
tives. For use in immunosorbents, the arrangement of sub- 
units connected by disulphide bonds can lead to a leakage 

10 of released heavy and light chains from columns. The re- 
quirement of successful docking of the two subunits con- 
tributing to the antigen binding site makes the production 
in bacteria of small subfragments with a low association 
difficult. The folding of the antibody is dependent on the 

15 formation of intra- and inter chain disulphidebonds, which 
are not able to form in the intracellular environment of 
bacterial cells. High-level intracellular expression sys- 
tems for recombinant antibodies leads to the formation of 
inclusion bodies, which have to be renatured to obtain 

20 biological activity. These limitations make it worthwhile 
to search for alternatives for use as protein domains cap- 
able of specific binding, to replace antibodies in a vast 
number of applications. 

The CDR regions forming the antigen bidning part of 

25 an antibody forms a total surface available for the anti- 
gen of approximately 800 A 2 , with typical 10-20 residues 
from the antibody involved in the binding. Using the 
structure of the complex determined by X-ray crystallo- 
graphy between an individual domain B of SPA and human 

30 fc[IgGI] as a starting point about 15 amino acids of the 
said domain involved in this binding can be determined or 
postulated. The binding surface of about 600 A 2 is of the 
same order of magnitude as between an antibody and its 
antigen. By arbitrary in vitro mutagenesis of these posi- 

35 tions simultaneously there is obtained a large library of 
Z variants with modified functional properties. In view of 
the fact that the regions of the Z domain constituting the 
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very stabile so called three-helix bundle is maintained in 
its native form spectra of proteins are generated which 
could be considered as "artificial antibodies" and which 
have the expected high solubility and excellent folding 
5 properties capable of binding to a large number of new 

ligands. Fusions of these artificial receptors to constant 
regions can be constructed to recruite effector functions, 
such as complement binding or triggering of ADCC ( antibody 
dependent cellular cytotoxicity). 

10 There are several potential advantages of utilizing 

the SPA structure [Z] as a starting point for such "arti- 
ficial antibodies" or artificial bacterial receptors. For 
a period of about 10 years a large number of proteins have 
been produced as fusions to SPA, where one has utilized 

15 the unique properties of the fusion partner in expression, 
refolding and purification. In these applications the Z 
domain has been found to be extremely soluble, stable 
against proteases, easy to produce in large amounts and 
foldable to a correct structure also intracellularly in 

20 Escherichia coli (no cysteins). Immunoglobulins (Ig:s) are 
substantially tetramers built up from so called 0-sheet 
structures which stabilize the orientation of the antigen- 
binding loops which in turn consist of continuous peptide 
sequences. This is to be compared to the monomeric Z do- 

25 main built up from so called three-helix bundle consisting 
of three closely packed a-helix structures, where the Fc- 
binding amino acids are found discontinuously in the se- 
quence but in the folded protein are positioned on one and 
the same binding surface. This difference with regard to 

30 the structural elements contributing to the formation of 
the binding surface enables new possible conformations 
which cannot be obtained in natural antibodies. The abili- 
ty of Z to be folded to the native structure also under 
conditions prevailing in the site of cytoplasma opens the 

35 possibility of using also derivatives thereof clinically. 
Genes coding for artificial antibodies with for example 
virus-neutralizing capacity can be distributed to cells 
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through so called gene therapy resulting in interrupting 
the infection at an early stage. 

From structure data for the complex between an indi- 
vidual Ig-binding domain [CI] of SPG and human Fc the 
5 binding surface can be studied. The binding which is es- 
sentially of an electrostatic nature involves side chains 
from amino acids from the a-helix as well as from the sub- 
sequent 0-sheet [#3]. These differences in structure com- 
pared to the Z domain makes it useful also to create a 

10 library of CI variants to investigate whether differences 
in binding patterns for artificial antibodies can be ob- 
served depending on the different conditions as regards 
the topology of the binding surface. Repertoires based on 
the structures of these and other receptors therefore of- 

15 fer different possibilities in the creation of artificial 
forms with novel functions. 

When producing recombinant proteins the purification 
of the product is frequently a major problem. By expres- 
sing the target protein as a fusion to a so called affi- 

20 nity tail the hybrid product can effectively and selec- 
tively be recovered from the cell lysate or in certain 
cases from the culture medium by passage through a column 
containing an immobilized ligand. Several such gene fusion 
systems have been described which are based on the inter- 

25 action of a certain protein with a ligand. For industrial 
applications it is often desirable to clean effectively 
the columns between the runs to satisfy purity require- 
ments by authorities. Depending on the nature of proteins 
the relatively harsh conditions (NaOH, acids, heat) often 

30 used for organic or physical matrises, for example in ion 
exchange chromatography and gel filtration, can normally 
not be used. Here the use of new ligands based on stable 
structures originating from bacterial receptors are of 
great importance. In this connection the Z domain from SPA 

35 is an excellent example since said domain can be subjected 
to such difficult conditions as a pH of 1 or heating to 
80 °C without denaturating non-reversibly (see Example 2 
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below). From the library of for example Z variants inte- 
resting protein products can be selected for use immobi- 
lized on a solid phase for affinity chromatography. These 
protein ligands are resistant, to effective purification 
5 conditions and are therefore useful repetitively on a 

large scale. In traditional immuno affinity chromatography 
where immobilized monoclonal antibodies are used for the 
selective purification of a certain product there are 
problems with leakage from the column of subunits (heavy 
10 and light chains) of the antibody since it consists of 
four polypeptide chains linked by cystein bridges. Since 
the artificial bacterial receptors consist only of one 
polypeptide chain this problem will be avoided. One par- 
ticular area of interest is selection for binding to car- 
15 bohydrates. Lectins, nature's binders to this large and 
important group of biomolecules, have been found to be 
difficult to purify and have limited stability. Since the 
generation of antibodies against carbohydrates has been 
found to be quite complicated a selection of new arti- 
20 ficial lectins will be of great importance to research, 
diagnostics and therapy. 

In the production of recombinant proteins in bacte- 
rial hosts precipitates of the gene product are frequently 
formed, so called inclusion bodies. In order to obtain a 
25 native structure of the protein this must be subjected to 
refolding in vitro. A limitation in such process one is 
often confronted with is the f*ct that a great part of the 
material precipitates in the procedure which results in 
low yields. By producing the protein with an extension in 
30 the form of either a short hydrofilic peptide or an easily 
soluble complete domain [Samuelsson, E. et al 1991 
Bio/Technol. 9:363-366] substantially higher concentra- 
tions of the protein will be obtained without precipita- 
tion taking place during renaturation. For example the 
35 high solubility of the said domain enables the use of in- 
creased solubility of proteins in either refolding from 
inclusion bodies or in so called reshuffling of disulphide 
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bridges. From libraries of artificial receptors new forms 
can be selected having improved properties to facilitate 
and even make refolding of recombinant proteins possible 
(cis-acting chaperones). 
5 Recently a new unit operation for the purification of 

recombinant proteins based on ion exchange chromatography 
in so called expanded bed has been described [Hansson, M. 
et al 1994 Bio/Technol. in press]. In this connection 
there is used a difference in isoelectric point between 

10 the target protein and the proteins of the host cell for 
selective enrichment on a positively charged ion exchange 
matrix. By fusion to the acid Z domain (pi 4.7) the ion 
exchange step can take place at a pH, at which the majori- 
ty of the contaminants were of the opposite charge compa- 

15 red to the fusion protein. By constructing libraries of 
bacterial receptors where a selection of amino acids have 
been replaced by the acid amino acids aspartate and gluta- 
mate also very acid and solubility increasing domains can 
be produced for use as fusion partners in the production 

20 of recombinant proteins. 

As previously described affinity systems based on 
protein ligands are not totally suitable for industrial 
purposes in view of the harsh conditions required in the 
cleaning of columns. Therefore, there is a need for fusion 

25 partners having specific affinity towards simple and cheap 
organic ligands. Panning of phage display libraries of 
different bacterial receptors against such ligands provide 
novel affinity tails suitable for the use as fusion part- 
ners for the production purification of recombinant pro- 

30 teins. 

The present invention provides means for producing 
and selecting proteins with novel functions. According to 
the invention this is achieved by extensive mutation of 
defined residues of stable domains of bacterial receptors. 
35 Due to the novel functions of the artificial bacterial re- 
ceptors, these can be used as specific binders for thera- 
peutic, diagnostic, biotechnology or in research. 
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Figure 9. Phagemid vector pKNl. The library PCR products 
encoding the variegated helices 1 and 2 (both 
the acid and the extensive library) was subclo- 
ned into the phagemid vector, pKNl, containing 
5 the gene for residues 44-58 of the wild type 2 

domain (essentially helix 3), followed by the 
gene for a 46 residues serum albumin binding 
region (ABP) derived from streptococcal protein 
G linked in frame with a truncated version of 
10 the M13 phage coat protein 3 gene. The phagemid 

contains the origin of replication derived from 
plasmid pBR322 as well as the intergenic region 
(fl ori) required for packing into the phage 
particles. 

15 

Figure 10. SDS-PAGE. HSA-affinity purified proteins from 
the periplasm of Escherichia coll cells produ- 
cing the wild type Z domain and two different 
acid Z -variants as ABP fusion proteins encoded 
20 from their respective phagemid vectors were 

analyzed by SDS/PAGE. M, molecular weight mar- 
ker; lane 1, wild type Z domain; lane 2, clone 
10; lane 3, clone 12. 

25 Figure 11. CD-data. Overlay plot of CD spectra obtained 

for the wild type Z domain and two variants of 
the Z -protein library. The signals of the pro- 
teins were obtained after subtraction of the CD 
signal contribution of the ABP tail, present 

30 during the analysis. 

Figure 12. Ion exchange chromatography. The two acid Z- 
variant proteins no. 10 and no. 12 together 
with the wild type Z-domain (produced as ABP 
35 fusion proteins) were each subjected to analy- 

sis at pH 5.5, employing an anion exchange 
chromatography column. Elution of the proteins 
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from the column was obtained by a NaCl gradi- 
ent. Top: acid 2- variant no. 12; middle, acid 
Z-variant no. 10; bottom, Z (wild type). Note 
that the wild type Z protein was not retarded 
5 on the column at this pH. 



Figure 13. Z-domain structure. A main-chain trace repre- 
sentation of the model of the structure of the 
native Z-domain. The structure of helices one 

10 and two are from the co-crystal structure be- 

tween domain B of SPA and Fc (Deisenhofer, 
(1981) Biochemistry, 20, 2361-2370). The third 
helix was built based on the secondary structu- 
re assignments from NMR spectroscopy (Gouda et 

15 al., (1992) Biochemistry, 31, 9665-9672). Non- 

hydrogen atoms of side-chains of residues that 
were mutated in the construction of the combi- 
natorial library are displayed as ball-and- 
stick models. The display was generated by the 

20 program MOLSCRIPT (Kraulis (1991) J.Appl. 

Cryst., 24, 946-950). 



Figure 14. Amino acid sequences. Result from DNA-sequen- 
cing of 31 randomly chosen Z-variants from the 

25 library. The residues subjected to the mutage- 

nesis are boxed. Horisontal lines indicate nu- 
cleotide identity with the wild type Z sequence 
listed at the top. Indicated are the clones 
that were expressed and characterized as fusion 

30 proteins to the ABP-tail. 



Figure 15. Aminoacid distribution. Result from the statis- 
tical analysis of the deduced amino acids at 
the mutated positions. In total, 13 residues 
35 from 31 clones (403 codons) were included in 

the calculation. The ratios between observed 
and expected frequencies are shown for all 20 
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with the wild type Z-domain (produced as ABP 
35 fusion proteins) were each subjected to analy- 

sis at pH 5.5, employing an anion exchange 
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from the column was obtained by a NaCl gradi- 
ent. Top: acid Z -variant no. 12; middle, acid 
Z-variant no. 10; bottom, Z (wild type). Note 
that the wild type Z protein was not retarded 
5 on the column at this pH. 

Figure 13. Z-domain structure. A main-chain trace repre- 
sentation of the model of the structure of the 
native Z-domain. The structure of helices one 

10 and two are from the co-crystal structure be- 

tween domain B of SPA and Fc (Deisenhofer, 
(1981) Biochemistry, 20, 2361-2370). The third 
helix was built based on the secondary structu- 
re assignments from NMR spectroscopy (Gouda et 

15 al., (1992) Biochemistry, 31, 9665-9672). Non- 

hydrogen atoms of side-chains of residues that 
were mutated in the construction of the combi- 
natorial library are displayed as ball-and- 
stick models. The display was generated by the 

20 program MOLSCRIPT (Kraulis (1991) J.Appl. 

Cryst., 24, 946-950). 



Figure 14. Amino acid sequences. Result from DNA-sequen- 
cing of 31 randomly chosen Z- variants from the 

25 library. The residues subjected to the mutage- 

nesis are boxed. Horisontal lines indicate nu- 
cleotide identity with the wild type Z sequence 
listed at the top. Indicated are the clones 
that were expressed and characterized as fusion 
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tical analysis of the deduced amino acids at 
the mutated positions. In total, 13 residues 
35 from 31 clones (403 codons ) were included in 

the calculation. The ratios between observed 
and expected frequencies are shown for all 20 
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amino acids as well for the only termination 
signal (TAG) included in the NNG/T degeneracy 
prof ile. 

5 Figure 16. SDS-PAGE analysis. HSA-affinity purified pro- 
teins from the periplasm of E.coli cells produ- 
cing the wild type Z domain and four different 
Z-variants as ABP fusion proteins encoded from 
their respective phagemid vectors were analyzed 

10 by SDS/PAGE. Lanes 1-5: Reduced conditions. 

Lanes 6 and 7: Non-reduced conditions. Lane 1, 
wild type Z domain; lane 2, clone 16; lane 3, 
clone 21; lane 4, clone 22; lane 5, clone 24; 
M, molecular weight marker; lane 6, clone 16 

15 and lane 7 , clone 22. 



Figure 17. CD-data. Overlay plot of CD spectra obtained 

for the wild type Z domain and four variants of 
the cc-helical protein surface library. The sig- 
20 nals of the variants were obtained after sub- 

traction of the CD signal contribution of the 
ABP tail, present during the analysis. 



Figure 18. Biosensor assay* An overlay plot of sensorgrams 
25 obtained from the BIA-core™ analysis of the 

wild type Z domain and four different variants 
(no. 16,21,22,24; Figure 4) fused to the ABP 
tail. The IgG-binding activities of the diffe- 
rent proteins were analyzed using a sensor chip 
30 coated with approx. 5000 RU human polyclonal 

IgG and injections of 45 pi pulses at 2 pl/min 
of 1500 nM solutions of the different proteins. 
Note that the differences in plateau values of 
signals during the injections of the variants 
35 no. 16,21,22 and 24 is due to divergent dilu- 

tions into the driving buffer. 
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All reagents and DNA constructions are available at 
the department for Biochemistry and Biotechnology, Royal 
Institute of Technology, Stockholm, Sweden. 

5 Material 

The oligonucleotides (Fig. 6) were purchased from 
Scandinavian Gene Synthesis (Sweden), and phosphorylated 
where indicated according to [Maniatis e t al (1988) Mole- 
cular cloning. A laboratory manual. Cold Spring Harbor 

10 Laboratory Press]. ZLIB-1 was biotinylated in the 5' -end 
enabling immobilization on paramagnetic beads M-280 Strep- 
tavidin purchased from Dynal A/S (Norway). Washing/binding 
buffer was 1 M NaCI, 10 mM Tris-HCl, pH 7.5, 1 mM EDTA 
(ethylenediamine tetraacetic acid). The annealing/ligation 

15 buffer was 30 mM Tris-HCl, pH 7.5, 10 mM MgCl 2 , 0.2 mM 
ATP, 1 mM 1.4 dithiothreitol (DTT). DNA ligase were from 
Boehringer Mannheim, Germany. 10 x PCR buffer contained 20 
mM MgCl 2 , 2 mM dNTPs, 100 mM Tris-HCl, pH 8.3, 50 mM KC1, 
1% Tween 20. Tag DNA polymerase was from Cetus Inc., USA. 

20 The thermal cycler was a Perkin-Elmer 9600. For the tempe- 
rature/stability scanning a J-720 spectropolarimeter 
(JASC0, Japan) was used. Escherichia coll strain RR1AM15 
[Riither, U. (1982) Nucl. Acids Res. 10:5765-5772] prepared 
for competence [Maniatis et al (1988) Molecular cloning. A 

25 laboratory manual. Cold Spring Harbor Laboratory Press] 
was used as host for the transformation. Agar plates con- 
tained 100 pg/ml of ampicillin. 

EXAMPLE 1 

30 Construction of an acid Z-library 

The synthetic 58 residue SPA analogue Z [Nilsson et 
al, Prot.Eng] was subjected to a mutagenesis approach to 
construct new variants with an altered pi, in order to 
produce fusion partners for recombinant proteins to be pu- 

35 rified by ion-exchange chromatography. Based on the crys- 
tal structure of the complex between the B-domain of SPA 
and human Fcl [Deisenhof er, J. et al 1981, Biochemistry 
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20:2361-2370], five residues from the B-domain participa- 
ting in the binding were chosen as targets for mutagene- 
sis. These five codons corresponding to the Z-residues No. 
9, 11, 14, 27 and 35 positioned in helices 1 and 2 were 
5 altered simultaneously using degenerate oligonucleotides 
with the triplet sequence G(C/A)(C/A) at these positions 
resulting in the codons for the amino acids alanine (50%), 
aspartic acid (25%) and glutamic acid (25%), respectively. 

Using a solid phase gene assembly strategy [StAhl et al, 

5 

10 Biotechniques 14:424-434] a library of genes encoding 3 
(243) acidic variants of the synthetic IgG-binding Z- 
domain was created (Fig. 5). Twenty microlitres (200 pg) 
of paramagnetic streptavidin-coated beads were washed with 
washing/binding buffer and incubated with 15 pmole of pre- 

15 hybridized oligonucleotides ZLIB-1 (biotinylated) and 
ZLIB-2, for 15 min at RT at a final volume of washing/- 
binding buffer of 40 pi. After ligation and washing, ap- 
proximately 15 pmole each of the oligonucleotides ACID-1 
(degenerated), LONGBRIDGE, and ACID-2 (degenerated) and 

20 the preannealed linker pair ZLIB-4/ZLIB-5 were added in a 
repetitive manner, with intervening washing steps accor- 
ding to Stihl et al [Biotechniques 14:424-434]. After com- 
pleted assembly, the different fragments were ligated for 
15 min at 37 °C. To amplify the amount of DNA coding for 

25 the Z( Acid) -library still immobilized onto the beads, a 
fraction was withdrawn and subjected to PCR. The PCR mix 
(50 pi) contained one pmole each of PCR primers ZLIB-3 and 
ZLIB-5, 5pl each of the ligation mix, 10 x PCR buffer and 
10 x CHASE, 1 unit of Tag polymerase and sterile water to 

30 50 pi. The temperature cycling programme was: 96°C, 1 min, 
60 °C, 1 min and 72 °C, 2 min, repeated for 35 cycles. Ana- 
lysis by 1% agarose gel electrophoresis showed a band of 
the expected size of 179 bp, showing the feasibility of 
the assembly concept. The 179 bp band from the PCR of the 

35 Z( Acid) -library, was cut out from the gel and purified 
(Geneclean™, Bio 101, Inc. USA) prior to insertion in a 
plasmid vector ( TA-cloning™ kit, Invitrogen, Inc. USA) 
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suitable for solid phase DNA sequencing [Hultman et al, 
1988] . After transformation and spreading on ampicillin 
containing agarplates two colonies were chosen for analy- 
sis of the obtained sequences- The results (Fig. 6) show 
5 that the expected degeneracy was found at the desired 
positions. 

EXAMPLE 2 

Measurement of the temperature stability of the Z 

10 conformation 

The temperature stability of the Z conformation was 
determined by following the ellipticity at 222 nm by cir- 
cular dichroism (CD) spectroscopy through a temperature 
scan. This wavelength is used to monitor the presence of 

15 cc-helicity of Z [Cedergren et al. 1993 Prot. Eng. 6:441- 
448] . The experiment was performed at a rather low pH 
(approximately 2.9) in order to destabilize the molecule 
since the mid-point of temperature denaturation (Tm) is 
~95°C at neutral pH (data not shown), which is outside the 

20 range that can be determined by a complete scan through 
the transition under normal atmospheric pressure. The 
experiment shows (Fig. 4) that the Tm (as defined by the 
inflexion point of the temperature scan) of the Z domain 
is as high as 71 °C at pH 2.9. This demonstrates the ex- 

25 treme temperature stability of the a-helices of the Z 
molecule. 

The experiment was performed in a J-720 spectro- 
polarimeter (JASCO, Japan) and the temperature was con- 
trolled by circulating water through the cuvette holder 

30 from a NESLAB water bath. The temperature was monitored in 
the cuvette through a micro sensor device (JASCO, Japan). 
The buffer was 50 mM acetic acid, pH 2.9. The protein was 
domain Z [Cedergren et al 1993 Prot- Eng. 6:441-448] at a 
protein concentration of 50 pg/mL and the cuvette cell 

35 path length was 1 cm. The temperature scan speed in the 
experiment was 50°C/h. 
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Io * Characterization of proteins derived from the acid Z-library. 

example .) were restricted with 45 U Bp 31 (Labassco AB. S. eden) - >° ^ 
(Pharmacia, Sweden) in 200 ul buffer (33 mM Tris-acetate. pH 7.9. 10 mM M„ 
«r mM potaasium.ace.ate. 0.5 mM DTT and 0.1 mg/ml BSA). Tne m,x was 
^ with mineral nr. and incubated a. 37-C over nigh, The restricte fragment 
Z lima.ely 5 M were purified by phenol/cWoroformfisoamylalcohol extract 

ion a.'l5=C over nigh, „ Mfa I cleaved pKNl vecror (1 w) (see beiow 
sC 3 5 Weiss unirs of T4 DNA Ug.se. The ligation mixture was hearted a, 
7c for 20 min, exceed with phenol/cMorofcrm/isoamylalcchol followed by 
1^1 whh chloroform, erhano, precipirared and redissolved in 20 ul of s,en.e 



15 water. 



The phaeemid vecror pKNl (figure 9) was constructed in several s.eps as follows. A 
L b e sanded linker encoding .he invariant residues 44-58 of .he Z-domatn was 
formed from oligonucleotides ZLIB-6 and ZLIB-7 and cloned as .W^ 
20 fra2ment i„,o phagemid pKP9S6 (A kind gift from Dr. Lars Abrahmsen, Pharmac a 
Bioscience Cen.er. Sweden), resulting in pKN. Plasmid pKP9S6 encodes .he E co 
Omp A leader peptide followed by residues 249-406 of fd filamentous phage coa 
proTein-3 (LowmLro,. (1»1) 30. .0832-10844., under , he contro 

of a lac promo.er. A gene fragment encoding a monovalent serum albumm btndm 
25 re . ion d£ri ved from streptococcal protein G was amplified by PCR from .be plasmrd 
pB2T (Eliasson e. of.. Molecular 28, 1055-1061), using prtmers ; AB. •. 

and KB P-2 (which contain Xho I and Sal 1 recognition si.es. respectively) and Coned 
im o X„o 1 restricted p.asmid pKN, yielding pKNl. This phagemid vector thus 
t „codes for the Omp A signal peptide, the .bird belix of the wtld type Z domun 
30 foucwed by a 46 residue albumin binding protein (ABP) linked to res.dues ,49-406 
of fd phaa. pro.=in in and is adap.ed for insertion of Er P 3I/M.e 1-diges.ed PCR 
products encoding variegated helices one and two of the Z domam. 

Freeze comoeten. E. coll RR1AM15 (rup£44 lacYl lacZara-H gclKl xyf-5 tmM 

(,982) Sucleic Acids Research. 10, 5765-5772) ceUs were transformed w,.h _.h« 
Nation mixture according to Maniatis and coworkers (Maniatis e, at. (198.) 
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Molecular cloning: A Laboratory Manual Cold Spring Harbor, Cold Spring Harbor 
Laboratory Press) and plated on agar plates containing 100 ug/ml ampicillin (Sigma, 
USA) and 19© glucose. Small amount of cells from randomly picked colonies were 
separately subjected to two-step PCR amplifications (30 cycles: 96°C, 15 s; 72 C C, 2 
5 minj on a GeneAmp PCR System 9600 (Perkin Elmer, USA), using 5 pmoles of 
primers RIT-27 and NOKA-2 (biotinylated) in 20 mM TAPS (pH 9.3), 2 mM MgCb, 
50 mM KC1, 0A% Tween 20, 0.2 mM deoxyribonucleoside triphosphates (dNTPs) 
and 1.0 U of Tag DNA polymerase (Perkin-Elmer). The solid-phase DNA sequencing 
of the PCR products was performed employing the FITC labeled sequencing primers 
10 XOKA-3 (for the immobilized strand) and ABP-2 (for the eluted strand) on a robotic 
workstation (Biomek™ 1000, Beckman Instruments, Fullerton, CA) and an 
Automated Laser Fluorescent (A.L.F.) DNA Sequencer™ (Pharmacia Biotech, 
Sweden) as described by Hultman and coworkers (Hultman ex aL % (19S9) Nucleic 
acids Research, 17, 4937-4946). 

15 

Two clones with the different encoded acid aminoacid substitutions (bold face) at the 
positions 9, 1 1, 14, 27 and 35 in the Z-domian according to table 1 were selected for 
further analysis. The wild type Z domain and the two different acid Z-variant proteins 
^ (clones no. 10 and 12) were expressed from their respective phagemid vectors as 
fusions to the serum albumin binding tail (ABP) and purified by human serum 
albumin-affinity chromatography. 



25 Table 1. Amino acid substitutions for selected clones in the acid Z-library a . 



Clone no. Encoded aminio acid at position no. 

9 11 14 27 35 



w.t. 


Gin 


Asn 


Tyr 


Axg 


Lys 


10 


GIu 


Asp 


Asp 


Ala 


Glu 


12 


Glu 


Asp 


Asp 


Ala 


Ala 



0, Letters in bold face indicate acid crr.inoacics 

35 
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Colonies of E. coli RR1AM15 cells harbouring the corresponding phagemid vectors 
were used to inoculate 100 ml of Tryptic Soy Broth (Difco), supplemented with 
ampicillin (100 ug/ml). The cultures were grown at 37 e C to an OD 60 onm=l. followed 
by induction with a final concentration of 1 mM IPTG and incubation at 30 e C over 
5 nicht. The cells were harvested by centrifugation at approximately 5000 g for 10 min 
and periplasmic proteins released by osmotic shock. The periplasm* content from the 
cells was subjected to affinity chromatography on HSA-Sepharose as described by 
Nygren and coworkers (Nygren et ai, (1988) J. Mol. Recognit., 1, 69-74) and 
analyzed by SDS/PAGE on a homogeneous 12% slab gel (BioRad Inc., USA), which 
10 was stained with Coomassie Brilliant Blue R-250. For all proteins appr. 1.5-2.5 mg/L 
culture could be recovered, indicating similar production and secretion efficiencies 
for the variants and the wild type domain. In addition, the results from the SDS- 
PAGE analysis (Figure 10) of purified proteins suggest that the acid Z variants 
analyzed are stably expressed in E. coli. 



15 



To investigate if the secondary structure content of the derivatives was preserved 
after the surface mutagenesis, a subtractive circular dichroism analysis was 
performed. IgG- or HSA-affinity chromatography purified proteins Z, Z-ABP, the 
acid derivatives no. 10 and 12 fused to the ABP tail as well as the ABP-tail itself 

20 were subjected to a 250 to 184 nm (far UV) circular dichroism analysis at room 
temperature using a J-720 spe.ctropolarimeter instrument (JASCO, Japan). The 
scanning speed was 10 nm/min. The cell pathlength was 1 mm. Solutions 
(approximately 0.1 mg/ml) of the different proteins were prepared in 20 mM 
phosphate buffer pH 6.5, supplemented with 0.05 % Tween 20 (Kebo AB, Sweden). 

25 Accurate protein concentrations were determined by amino acid analysis on a 
Beckman 6300 amino acid analyzer equipped with System Gold data handling 
system. CD signals for the derivatives were obtained by subtracting the signal 
obtained for the ABP tail, after adjustments for differences in protein concentrations, 
followed by normalization for amino acid contents. 

30 

A comparison of signals obtained from 250 to 184 nm for the wild type Z domain and 
the acid variants fused to the ABP-tail was performed after subtraction of the 
contribution from the ABP tail itself. The result shows that for the two acid Z- 
derivatives, spectra similar to the wild type Z domain were obtained with a 
35 characteristic minimum at 208 nm and an inflexion point at 222 nm (Johnson, 1990) 
(Fisure 1 1). This suggests that the three helix bundle framework is preserved in these 
mutants. 
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The two Z- variants, no. 10 and 12, contain four and three introduced acid aminoacids, 
respectively, compared to the native Z domain. In order to investigate if the 
introduced acidity was reflected as differences in their isoelectric points, they were 
subjected to a gradient elution from an anion exchange column. The proteins Z(wild 
5 type) and the acid variants no. 10 and no. 12 (all produced as ABP fusion proteins) 
were each (5 ^g) dissolved in 300 ul of 20 mM Piperazine buffer (pH 5.5) and 
separately applied at 100 ul/min on aMonoQ, PC 1.6/5 column (Pharmacia, Sweden). 
Elution of the proteins were performed by applying a NaCl gradient in Piperazine 
buffer (pH 5.5) (Sigma, USA) ranging from 0-50 % NaCl in 20 min. The results from 
10 the analysis (figure 12) shows that the two acid Z.-variant proteins were eluted at 
different NaCl concentrations suggesting clear differences in isoelectric points. In 
contrast, at the pH chosen during the experiments, the wild type Z-domain did not 
interact with the resin, and was therefore seen in the flow-through. 

15 Thus, the series of experiments performed on the two acid Z- variant proteins shows 
that the expression behaviour, proteolytic stability and secondary structure content of 
the variants were unchanged when compared to the native Z-domain. Furthermore, a 
novel functions were introduced into the two Z-variants by the substitution of surface 
located positions with acid amino acids. The two acid variants can be used e.g. as 

20 fusion partners to facilitate purification of recombinant proteins by ion exchange *' 
chromatography at low pH. Thus, it is showed that among the members of the acid Z- 
library, variants with novel functions can be isolated. 



25 Example 4. Construction and characterization of a combinatorial 
library of Z-variants. 

A library of Z-variants was assembled using a solid-phase gene assembly strategy 
(see example 1). Most of the amino acid residues suggested to take pan in the binding 

3 q to Fc (Deisenhofer, (1981) Biochemistry, 20, 2361-2370) were found to be on the 
molecule surface (Q9, Q10, Nil, F13, Y14, L17, N28, Q32 and K35), and therefore 
included in the mutagenesis. In addition, based on their surfacial location, other 
residues (HI 8, E24, E25 and R27) were also decided to be included. In total, 13 
residues in the Z scaffold where thus chosen for simultaneous and random 

22 mutagenesis. A set of oligonucleotides (Figure 6) were synthesized for construction 
of the library of surface mutants of the 58-residues monovalent IgG-binding domain 
denoted Z. In this library, the codons for Q9, Q10, Nil, F13, Y14, L17 and HIS 
located in the first a-helix and E24, E25, R27, N28, Q32 and K35 in the second a- 
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helix of the Z domain (Figure 13) were substituted for degenerate NNK (K=G or T) 
codons using a solid phase strategy utilizing the single stranded degenerate 
oligonucleotides for the assembly. The chosen NNK degeneracy includes 32 codons 
covering all 20 amino acids, including the' TAG (amber) termination signal. 

5 

Oligonucleotide ZLIB-1 was synthesized with a 5* biotin group to enable robust 
anchoring onto streptavidin-coated paramagnetic beads used as solid support during 
the sene assembly. This ZLIB-1 oligonucleotide, together with its complementary 
sequence (ZLIB-2) encodes residues 1-8 of the Z domain, preceeded by the first six 

10 residues of region E of protein A which were included to facilitate the E. coli 
secretion of the Z variants (Abrahmsen ex al, (1986) EMBO J., 4, 3901-3906). The 
oligonucleotides DEGENM and DEGEN-2 (Table I) encode the two mutated heiices 
of the Z domain, respectively, normally involved in Fc-binding. Theoretically, full 
and simultaneous NNK degeneracy at the 13 selected positions would yield # a 

15 combinatorial library of appr. 8-10 16 protein variants encoded by 3.7*10 19 different 
DNA sequences. However, here the assembly of the library was initiated by the 
immobilization of appr. 15 pmole of prehybridized oligonucleotides ZLIB-1 and 
ZLIB-2 (Figure 6), which limits the theoretical size of the Z-library to appr. 0.9* 10 13 
different DNA sequences encoding appr. 2-10 10 Z variants. The assembly was 

20 continued by the addition and ligation of a preformed construe;, obtained after 
ligation of equimolar amounts of oligonucleotides DEGEN-1 and DEGEN-2, 
facilitated by the bridging oligonucleotide BRIDGE (Figure 6). 

To complete the assembly, a fragment consisting of the prehybridized oligo- 
25 nucleotides ZLIB-4 and ZLIB-5 was added to the beads for ligation. This fragment 
encodes the second loop and the first six residues of the unaltered third helix of the Z 
domain. After completed assembly, oligonucleotides ZLIB-3 and ZLIB-5, containing 
the recognition sequences for the endonucleases Esp 3 I and Nhe I respectively, were 
used as primers for PCR amplification of the assembled constructs using one tenth of 
30 the bead-immobilized ssDNA as template (theoretically corresponding to 2*10 9 
protein variants). To avoid unwanted interference during the amplification, 
oligonucleotides ZLIB-2, BRIDGE and ZLIB-5 were first eluted with alkali. The 
resulting PCR product was analysed by agarose gel electrophoresis and found to be 
homogenous and of the expected size, 179 bp. 

35 

The PCR product was subcloned into the pKNl phagemid vector containing the gene 
for residues 44-58 of the wild type Z domain in frame with a truncated version of the 
fd phage coat protein 3 gene for surface display on phage panicles upon helper phage 
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superinfection of phagernid transformed £. coli cells (Lowman ex al % (1991) 
Biochemist!?, 30, 10S32-10S44) (Figure 9). In addition, the phagernid vector contains 
an interspaced in-frame cassette encoding a 5 kDa (46 aa) serum albumin binding 
region (denoted ABP) derived from streptococcal protein G (Nygren ex al % (1988) J. 
5 Mol Recognita 1, 69-74; Nilsson ex al % (1994) Eur. J. Biochem., 224, 103-108), 
enabling efficient affinity purification of produced Z variants devoid of their native 
Fc-binding activity. Furthermore, the serum albumin binding activity can potentially 
be used for pre-selection of phage panicles carrying recombinant molecules, prior to 
the panning for Z variants with new binding functions, to decrease the background 
10 originating from unspecificallv bound non-recombinant phage particles. 

After transformation, PCR screening (using the oligonucleotides RIT-27 and NOKA- 
2) of 25 clones showed that over 957c (24/25) of the clones contained an insert of the 
expected length, suggesting that the gene assembly procedure was carried out with 

15 high efficiency. Fonyfive transformants were randomly selected and subjected to 
direct solid phase DNA sequencing (see Example 3) in order to further analyze the 
quality and heterogeneity of the library. Approximately 69% of the clones were 
correct, containing wild type and degenerate codons at expected positions. The 
remaining clones had spurious discrepancies which in part can be attributed to the 

20 oligonucleotide synthesis or errors introduced during PCR. The correct clones (31 
clones) (Figure 14) were further analyzed for codon representation at the 13 
degenerate positions. The distribution of the total 403 resulting deduced amino acids 
among the 32 codons included in the NNK degeneracy profile shows a close 
correlation with the expected frequencies for these yet unselected clones (Figure 15). 

25 To investigate the expression and stability of the Z- variants, four clones (no. 16, 21, 
22, 24; Figure 14) with different degrees of substitution as well as the wild type Z 
domain were produced as ABP fusions encoded from their respective phagernid 
vectors. Soluble proteins from the periplasm of IPTG-induced cultures were subjected 
to HSA-affinity chromatography employing the ABP-tail for general and efficient 

30 recovery' (Nygren ex al % (19SS) J. Mol. Recognita 1, 69-74). For all proteins appr. 
1.5-2.5 mg/L culture could be recovered, indicating similar production and secretion 
efficiencies for the variants and the wild type domain. The results from a SDS-PAGE 
analysis (Figure 16) of purified proteins suggest that the four Z variants analyzed are 
stably expressed in E. coli. The smaller band with HSA-binding activity, seen with 

35 different intensities most probably corresponds to the ABP-tail itself (5 kDa), 
resulting from proteolytic cleavage between the Z variant and the ABP tail. 
Interestingly, both Z- variants (no. 16 and 22) with introduced cysteine residues 
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formed dimers, which could be observed under non-reducing conditions during SDS- 
PAGE (Figure 13; lanes 6 and 7). 

To investigate if the secondary structure content of the derivatives was preserved 
5 after the extensive surface mutagenesis, a subtractive circular dichroism analysis was 
performed (see example 3). A comparison of signals obtained from 250.to 184 nm for 
the wild type Z domain and the four variants fused to the ABP-tail was performed 
after subtraction of the contribution from the ABP tail itself. The result showed that 
for three of the four derivatives spectra similar to the wild type Z domain were 

10 obtained, with a characteristic minimum at 208 nm and an inflexion point at 222 nm 
(Johnson, (1990) Prot. Struct. Fund. Genet., 7, 205-224) (Figure 17). This suggests 
that the three helix bundle framework probably is preserved in these mutants. 
However, for the fourth derivative (no. 24), a spectrum was obtained which resembles 
spectra seen for random coils, indicating a low content of secondary structure 

15 elements (Johnson, 1990). This derivative contains a glutamine to proline substitution 
at position 32 in helix 2, suggesting a destabilization leading to a collapse of the helix 
bundle framework. 

In order to further investigate the four Z-variants, the interaction with polyclonal 
20 human IgG (hlgG) (Pharmacia AB) for wild type Z and four different Z variant 
clones (no. 16, 21, 22, 24; Figure 14) fused to the ABP tail were compared using 
biosensor technology (BlAcore™, Pharmacia Biosensor AB, Sweden). The 
carboxylated dextran layer of a CM-5 sensor chip was activated using N- 
hydroxysuccinimide (NHS) and A^-ethyl-^'.[3-diethylaininopropyl]-carbodiimide 
25 (EDC) chemistry according to the manufacturers 1 recommendations. For immo- 
bilization of hlgG, 20 pi of a 500 nM hlgG solution in 50 mM acetate. pH 4 was 
injected at a flow rate of 5 Lil/min over the activated surface, resulting in the 
immobilization of approximately 5000 resonance units (RU). Fonyfive-microlitre 
samples of the five fusion proteins, dissolved to approximate concentrations of 1500 
30 nM in NaCl/Hepes (10 mM Hepes, pH 7.4, 150 mM Nad. 3.4 mM EDTA, 0.5% 
surfactant P-20), were injected in separate experiments at a flow rate of 2 ul/min. 
After each sample injection, the hlgG surface was regenerated with 20 mM HC1. As 
expected, only the wild type Z-domain showed any detectable Fc-binding activity 
(Figure 18). 

35 
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In conclusion, the results show that a library of SPA variants with a substituted 
surface made up from 13 residues located in the a-helices can be constructed. The 
high degree of conservation of the overall framework of the native Z-domain suggests 
that derivatives with novel functions grafted onto a stable and soluble scaffold could 
be isolated for use as artificial antibodies in biochemistry, immunology and 
biotechnology. 
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31 
CLAIMS 

« 

1. Novel proteins obtainable by mutagenesis of 
surface-exposed amino acids of domains of natural bacte- 

5 rial receptors, said proteins being obtained without sub- 
stantial loss of basic structure and stability of said 
natural bacterial receptors. 

2. Proteins according to claim 1, which have been 
selected from a protein library embodying a repertoire of 

10 said novel proteins. 

3. Proteins according to claim 1 or 2, which are 
fused to a phage-coat protein, and wherein said bacterial 
receptors preferably originate from Gram-positive bacte- 
ria. 

15 4. Proteins according to claim 3, wherein said bacte- 

rial receptors originate from bacteria selected from 
Staphylococcus aureus. Streptococcus pyogenes [group A], 
Streptococcus group C,G,L, bovine group G streptococci, 
Streptococcus zooepidemicus [group C] , Streptococcus 

20 zooepidemicus S212 r Streptococcus pyogenes [group A], 
streptococci groups A,C,G, Peptostreptococcus magnus, 
Streptococcus agalactlae [group B] . 

5. Proteins according to claim 4, wherein said bac- 
terial receptors originate from staphylococcal protein A 

25 or streptococcal protein G. 

6. Proteins according to claim 4 or 5, wherein said 
bacterial receptors originate from receptors selected from 
Fc[ IgG] receptor type I, type II, type III, type IV f type V 
and type VI, fibronectin receptor, M protein, plasmin re- 

30 ceptor, collagen receptor, fibrinogen receptor or protein 
L [K light chains], protein H [human IgG] , protein B 
[human IgA,Al], protein Arp [human IgA] . 

7. Proteins according to claim 5, wherein said recep- 
tor originates from the Fc[ IgG] receptor type I of staphy- 

35 lococcal protein A or the serum albumin receptor of strep- 
tococcal protein G. 
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8. Proteins according to claim 7, wherein said recep- 
tor originates from the IgG-binding domains Z, CI, and the 
serum albumin binding domains B2A3. 

9. Proteins according to. any preceding claim, wherein 
5 said substitution involves at most about 50% of the amino 

acid residues of the original bacterial receptor. 

10- Proteins according to claim 9, wherein said sub- 
stitution involves at most about 25% of the amino acid re- 
sidues of the original bacterial receptor. 
10 11. Proteins according to any of claims 1 to 7, 

wherein said substitution involves at most substantially 
all of the amino acid residues taking part in the interac- 
tion function of the original bacterial receptor. 

12. Proteins according to any preceding claim, where- 
15 in said substitution has been obtained by site-directed 

mutagenesis. 

13. Proteins according to any preceding claim, where- 
in said substitution is directed to creating specific in- 
teraction capacity vis-a-vis substances selected from pro- 

20 teins, lipids, carbohydrates and inorganic substances as 
interaction partners. 

14. Proteins according to claim 13, wherein said sub- 
stances are selected from carbohydrates, such as blood 
group determinants, and pathogen specific oligosaccharides 

25 as interaction partners. 

15. Proteins according to claim 13, wherein said sub- 
stances are selected from IGF-1, IGF- II, hGH, Factor VIII, 
insulin and apolipoprotein and their respective receptors 
as interaction partners. 

30 16. Proteins according to claim 13, wherein said sub- 

stitution is directed to creating specific interaction 
capacity vis-a-vis substances selected from viral coat 
proteins, bacterial antigens, biotin, and cell markers, 
such as CD 34, CD 4. 

35 17. Proteins according to claim 13, wherein said sub- 

stitution is directed to creating specific interaction 
capacity vis-a-vis antibody fragments, such as Fv, scFv, 
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Fab, and Fc. 

18. Proteins according to claim 13, wherein said sub- 
stitution is directed to creating specific interaction 
capacity vis-a-vis organic ligands. 
5 19. A method for the manufacture of an artificial 

bacterial receptor structure comprising the steps: 

a) subjecting the repertoire of novel receptor struc- 
tures obtained according to claim 1 to a selection proce- 
dure based on a desired interaction function; and 

10 b) isolating the selected receptor structure. 

20. A method according to claim 19 , comprising the 
steps: 

al) preparing, by recombinant DNA techniques, phage 
particles carrying on their respective surfaces proteins 
15 from said repertoire of novel receptor structures and 
fused to phage-coat proteins; 

a2) panning from a pool of phage particles resulting 
from step al ) to select specific phage clones displaying 
desired binding characteristics; and 
20 b) isolating said specific phage clones using inter- 

actions associated with said binding characteristics. 

21. A method according to claim 19 for selection of 
receptor structures associated with protein of a non- 
secretory nature, comprising the steps: 

25 a) preparing, by recombinant DNA techniques, fusion 

proteins, wherein the proteins of said library are fused 
to a repressor protein with affinity for a specific plas- 
mid-borne operator region resulting in interaction between 
a specific protein variant and a plasmid encoding the 

30 same; and 

b) isolating selected proteins using said interac- 
tion. 

22. A method according to claim 19, comprising the 
steps: 

35 al) preparing, by recombinant DNA techniques, bacte- 

rial cells carrying on their respective surfaces proteins 
from said repertoire of novel receptor structures and 
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fused to cell-wall anchoring domains functional in said 
bacterial cells; 

a2) panning from a pool of bacterial cells resulting 
from step al ) to select specific bacterial clones display- 
ing desired binding characteristics; and 

b) isolating said specific clones using interactions 
associated with said binding characteristics. 
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